Dynamic Ontology Based Model for Text Classification
نویسنده
چکیده
Text Mining is the process of extraction of knowledge from large amounts of text. Information retrieval gains more focus due to tremendous increase of textual information in web servers. It is a very challenging task to process, organize, analyze and extract knowledge from huge volumes of unstructured text. Traditional text classification algorithms needs well defined text corpus to train and classify the given textual information. It is a very complex task to build text corpus with the help of thesaurus on the given text documents. To overcome this, ontology models can be considered for constructing knowledge base. Domain ontologies are considered for extracting more useful and high quality data on a particular domain. Many traditional approaches use static methods to construct domain ontologies. But, ontologies developed by these static approaches consist of limited terms in their knowledge base due to lack of updation. In this paper, we propose a dynamic ontology based model to classify the extracted terms and to build knowledge base on a particular domain. Experimental results show that our dynamic ontology based model performing excellently by frequent updating of extracted terms in our knowledgebase.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملArabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents
Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...
متن کاملA New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملA High-Performance Model based on Ensembles for Twitter Sentiment Classification
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016